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Abstract 

Spectrum management has been identified as a crucial step towards enabling the technology of a 
cognitive radio network (CRN). Most of the current works dealing with spectrum management in the 
CRN focus on a single task of the problem, e.g., spectrum sensing, spectrum decision, spectrum sharing 
or spectrum mobility. In this two-part paper, we argue that for certain network configurations, jointly 
performing several tasks of the spectrum management improves the spectrum efficiency. Specifically, our 
aim is to study the uplink resource management problem in a CRN where there exist multiple cognitive 
users (CUs) and access points (APs). The CUs, in order to maximize their uplink transmission rates, have 
to associate to a suitable AP (spectrum decision), and to share the channels used by this AP with other 
CUs (spectrum sharing). These tasks are clearly interdependent, and the problem of how they should be 
carried out efficiently and in a distributed manner is still open in the literature. 

In this first part of the paper, we focus on the problem of spectrum sharing in a multi-channel CRN 
with a single AP. The insight gained from the analysis of this simpler network is used as the building 
block for analyzing the multiple AP network in the second part of this paper We formulate the single AP 
spectrum sharing problem into a non-cooperative power allocation game, in which individual CUs aim 
at maximizing their transmission rate by finding the suitable power allocation on the available channels. 
Interestingly, we discover that the set of equilibrium solutions of this game possesses the following 
optimality properties: 1) any equilibrium solution is the optimum input power allocation scheme in 
the sense that it maximizes the sum rate of the network if joint decoding at the AP is employed; 
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2) asymptotically, when the number of channels becomes large, any equilibrium solution becomes a 
Frequency Division Multiple Access (FDMA) strategy, and the maximum system sum rate is achieved 
without joint decoding. We subsequently propose a set of algorithms for the CUs in the network to 
achieve such equilibrium solutions in distributed fashion. 

I. Introduction 

A. Motivation and Related Work 

The problem of distributed spectrum management in the context of CRNs has been under intensive 
research recently. As pointed out by the authors of ||2], spectrum management needs to address four main 
tasks: 1) spectrum sensing, techniques that ensure CUs to find the unused spectrum for communication; 
2) spectrum decision, protocols that enable the CUs to decide on the best set of channels; 3) spectrum 
sharing, schemes that allow different CUs to share the same set of channels; 4) spectrum mobility, rules 
that require the CUs to leave the channel if licensed users are detected. Many efforts have been devoted 
to providing solutions to the individual tasks listed above. However, as we will see in this two-part 
paper, in some CRN scenarios, several of the above tasks become interdependent, and the CUs have to 
perform these tasks jointly to achieve best performance. We thus propose to provide solutions for the 
joint spectrum decision and spectrum sharing problems in a multi-channel multi-user CRN. 

In this two-part paper we focus on investigating an important CRN configuration where such joint 
spectrum decision and spectrum sharing is desirable. Consider a network with multiple CUs and APs. 
The APs operate on non-overlapping spectrum bands, and the CUs need to connect to one of the APs 
for communication. At this stage the CUs essentially perform a spectrum decision task, in which they 
decide on the best spectrum band to use, i.e., the best AP to connect to. After the AP selection, the CUs 
can use multiple channels belonging to the associated AP concurrently for transmission, but different 
CUs interfere with each other if they use the same channel. At this stage the CUs perform a spectrum 
sharing task, in which multiple CUs use the same spectrum band for communication. This network is a 
generalization of the single AP network considered in previous literature, e.g., ||3], El and ||5]. It also 
bears sufficient similarity to the operational model of the IEEE 802.22 cognitive radio standard ||6], in 
which multiple service providers install their respective APs to serve the same geographic region. In the 
considered network, the CUs face the spectrum decision problem when they select the AP, and they face 
the spectrum sharing problem when they try to dynamically allocate their communication power across 
the channels belonging to the selected AP. Clearly, these two problems are strongly interdependent, as 
on the one hand a particular CU has to select an AP before it can share the spectrum assigned to 
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this AP with all the other CUs associated with it; on the other hand, after sharing the spectrum, an 
individual CU may have the incentive to switch to a different AP if it perceives that such action will 
increase its communication rate. A poor spectrum decision and spectrum sharing scheme will not only 
lead to unsatisfactory performance for individual CUs, but also result in an unstable system in which 
CUs are constantly unsatisfied with their current communication rates and consequently changing their 
AP associations and power allocation indefinitely. 

In the first part of this paper, we focus on the spectrum sharing aspect of the above problem. Specifically, 
we study the uplink power allocation problem in a CRN with a single AP. This problem is important in 
its own right, and the insight gained from studying this problem serves in studying the more complicated 
network with multiple APs that we discuss in the second part of this paper. 

Centralized strategies for resource allocation in multi-carrier single AP network has been extensively 
studied. In Q and lH, a joint sub-carrier assignment and power allocation algorithm is proposed for 
downlink orthogonal frequency division multiplexing (OFDM) network with the objective to optimize 
utility functions related to throughput and fairness. It has been shown that when the utility function 
is properly chosen, the optimum access strategy is FDMA, and the downlink throughput can achieve 
Shannon capacity. In [9l, the authors formulate the optimum (in the sense of minimizing the received 
mean square error) linear transceiver design problem in a multiple access (MAC) intersymbol interference 
(ISI) channel into an optimum uplink subcarrier allocation and power loading problem, and propose a 
strongly polynomial algorithm to determine such optimum strategy. lITOl proposes a numerical method 
to compute the capacity of FDMA MAC channel as well as to (near-) optimally assign the channels. 
The proposed method assigns the channel to different users by solving a (convexly relaxed) optimization 
problem. lITTI and |[T2l are two recent developments for uplink/downlink resource allocation in multi- 
carrier systems. The uplink optimality of OFDMA system has been discussed in |[T3l . in which the 
authors derived sufficient conditions of the channel state as well as the received signal noise ratio for 
the OFDMA system to achieve maximum uplink system sum rate. We note that the centralized schemes 
usually assume that the AP carries out all the necessary computations and enforces the resultant optimum 
policies among the mobile users in the network. 

However, such centralized scheme may not be applicable in networks where individual mobile users 
are autonomous or selfish and have the intention and the ability to deviate from the centralized poUcies 
(e.g., in the cognitive radio network). Consequently various distributed algorithms are proposed in the 
literature, for example, ||T41 . lITSl and |@]. In ||T41 . a distributed power allocation scheme is proposed for 
uplink OFDM systems where the channel state is simplified to having only discrete levels. Notably, this 
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scheme only requires that each user has the knowledge of its own channel state information (CSI), thus 
the signaling needed for the users to obtain the global CSI (as required by the algorithms proposed in, 
say, 131) from the AP is greatly reduced. |[T5l is a recent work considering the uplink dynamic spectrum 
sharing problem in a multi-carrier multiple service provider cognitive network. The authors developed a 
distributed algorithm for the users to jointly choose the size of the spectrum as well as the amount of 
power for transmission. However, in this work the channel is considered to be flat for each user, as a 
result, the users only need to select the size of spectrum they need (because any portions of the spectrum 
of the same size is equivalent to the users), which greatly simplifies the analysis. In [4], the problem 
of distributed energy-efficient power control in uplink multi-carrier CDMA system is considered. The 
authors formulate the problem into a game-theoretic framework, and a distributed algorithm is proposed 
in which each user transmits only on its "best" channel. ||T6l considered a generalization of a multi- 
carrier MAC channel, and proposed a distributed iterative water-filling (IWF) algorithm to compute the 
maximum sum capacity of the system. It is worth mentioning that the algorithms proposed in IH and 
|[T6l both require that the users update their transmission strategies sequentially, thus may result in slow 
convergence when the number of users in the system is large. 

In this first part of the paper, we formulate the uplink spectrum sharing problem into a non-cooperative 
game, in which the CUs in the network try to maximize their individual transmission rate. Due to the 
structure of the considered multi-channel network, we are able to identify the proposed game as a potential 
game lITTl . in which the players in the game, although selfish by nature, behave as if they aim at jointly 
optimizing an objective function (which is called the potential function). Such underlying structure of the 
game allows us to characterize many optimality properties of the equilibrium solution. In particular, the 
maximum value of the potential function equals to the maximum sum rate achievable for the network. 
As a result, the CUs can be viewed as jointly optimizing the system sum rate. As far as we know, such 
interesting relationship between a spectrum sharing game and the system sum rate in multi-carrier single 
AP network has not been shown in the literature. We then propose three algorithms with convergence 
guarantees that allow the CUs to reach the equilibrium solution(s) in a distributed fashion. For the sake of 
rigor, we categorize our algorithms either as "weak convergent", in which the individual CUs' strategies 
converge to the set of equilibria, or as "strong convergent", in which the individual CUs' strategies 
converge to an equilibrium point. Such distinction is technical yet necessary, as we will point out in 
section ITV-AI for the reason that unlike most other spectrum sharing games (e.g., lITSl . |[T9l ). our game 
generally admits a connected set of equilibria, consequently it is possible, at least theoretically, for the 
algorithm to converge to the set of equilibria without converging to any equilibrium point. 
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B. Organization of This Work and Notations 

This part of the paper is organized as follows. In section HI] we present the system under consideration, 
and formulate the spectrum sharing problem into a non-cooperative game. In section Jill we give detailed 
analysis regarding to the property of the NE of the game. In section |lVl we propose distributed algorithms 
and study their convergence properties. In section |V] we show the numerical result. This part of the paper 
concludes in section |Vll 

Some notations used in this paper are specified as follows: we use bold lowercase and uppercase letters 
for vectors and matrices, respectively. The [i,])*^ element of a matrix X is denoted by [X]j j. For a 
symmetric matrix X, X ^ signifies that X is positive semidefinite. The trace of a matrix is denoted 
by tr(X); the determinant of a matrix is denoted by |X|. I.„ is used to denote a n x ?i identity matrix. 
For a vector x, diag{ii) represents a diagonal matrix with its diagonal entries equal to the entries of the 
vector X. We use /5(X) to denote the spectral radius of the matrix X. 

II. Problem Formulation 

A. System Model 

We consider a wireless network with a set AA = {1, 2, • • • , N} CUs and a single AP. Let us normalize 
the total available bandwidth to 1, and divide it equally into K channels; let the set /C = {1, 2, • • • , K} 
represents the set of available channels. 

The foUowings are our main assumptions of the network. 
A-1) The available spectrum can be used exclusively by the CRN, for a relative long period of time. 
A-2) Each CU can concurrently use all the channels of the AP for transmission, if desired. 
A-3) The AP is equipped with single-user receivers. 

Assumption A-1) can be achieved either under the spectrum property right model in which the licensed 
networks sell or lease the spectrum to the cognitive network for a period of time for exclusive use, or 
under the situation that the cognitive network exploits relatively static spectrum white spaces unused by 
local TV broadcast EOl . Assumption A-3) is congruent with the lack of coordination of the CUs, because 
an individual CU essentially treats other CUs' transmission as noises. It also allows for implementation 
of low-complexity receivers at the AP. This assumption is generally accepted in designing distributed 
algorithm in multiple-access channels ||3l. 

Let Xi{k) denote the complex Gaussian signal transmitted by CU i on channel k; let Pi{k) = E[{xi{k))'^] 
denote the transmitted power of CU i on channel k. Let z{k) ^ CN{0, n(/c)) denote the white complex 
Gaussian environment noise experienced at the receiver of AP with mean zero and variance n{k). Let 
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hi{k) denote the channel gain coefficient between CU i and the AP on channel k. The signal received 
at the AP on channel k, denoted by y{k), can then be expressed as: 



JV 



y{k)=J2^^ik)hi{k)+z{k). (1) 



i=l 



Define = [xi{l),--- ,Xi{K)Y; z = ,z{K)Y; Hj = diag [hi{l), ■ ■ ■ ,hi{K)]. Then the 

received vector signal y = [y(l), • • • ,y{K)Y can then be expressed concisely as: 



N 



HjXi + z. (2) 



i=l 

Letpi = [pi{l),--- , Pi (_?;:)]■'■ be CUz's transmission power profile; let p_j = [Pi,--- ,Phi,Pi+i, ■ ■ ■ ^PnY 
be the transmission profile of all other CUs except CU i; let p = [pj, • • • , p]^] ^ be the system power 
profile. Define pi to be CU i's maximum allowable transmission power, then its feasible space can 
be expressed as Vi = |pj : Pi > 0, ^^^^^ pi(A;) < In this network when single-user receiver is 
employed at the AP, and when assuming other CUs' power profiles are fixed, the CU i's maximum 
achievable rate can be expressed as 11211 : 

We assume that each CU i € M has the knowledge of its own channel coefficients {hi{k)}keJC^ 
and the quantity '^j^iPj{k)\hj{k)\'^ + n{k) on every channel, which represents the sum of noise plus 
interference on each channel. This information can be obtained by the AP and fed back to each CU i, 
as suggested in We do not assume that individual CU has any information regarding the other CUs' 
channel coefficients; nor do we require that individual CU i knows the power budget {pj}j^i or the 
power allocations p_j of other CUs. 

The problem that the CUs are facing is that under the above sets of system constraints, how should 
they decide on the policy for efficiently sharing of the available spectrum in a distributed manner? In the 
following subsection, we formulate such spectrum sharing problem into a game-theoretical framework. 

B. A N on-Cooperative Game Formulation 

In order to facilitate the development of a distributed algorithm, we model each CU as selfish agent, 
and its objective is to maximize its own transmission rate. More specifically, when p„j is fixed, CU i is 
interested in solving the following optimization problem: 



max 



1 ^,.(. , P^{k)\K{k)\'' 



P^^V.K^ "\ nik) + Ej^^Pjik)\h,{k)\ 
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The solution to this optimization problem is the well-known single-user water-filling solution lIlTI : 

n{k) + Y.,^^p,{k)\h^{k)f ' 



y keIC (5) 



MkW 

where > is the dual variable for the sum-power constraints. 

We introduce a non-cooperative spectrum sharing game where the players of the game are the CUs in 
the network, the utility of each player is its achievable transmission rate, and the strategy of each player 
is its transmit power profile. We denote this game as G = {Af,V, {Ri{-)}ieAf}^ where V = IljeAA^j 
the joint feasible region of all CUs. 

The Nash Equilibrium (NE) of the above game is defined as the strategies {p,*}jeAr satisfying ll22l : 

P* e argp^gp^ maxiii(pj, plj) V i G A/". (6) 

Intuitively, a NE of the game is a stable point of the system where no player has the incentive to 
deviate from its current strategy. In the following sections, we will first set out to analyze the properties 
of the set of NE of game G, and then propose distributed algorithms to reach the set of NE. 

III. Characteristics and Optimality of the NE 

A. NE as Maximizers of the Potential Function 

In order to facilitate the analysis, we introduce the notion of a potential function of the game G. Define 
a concave function P : P — M: 

^(p) - E {^^^) + E \h^mh^{k)^ - ^og{n{k))^ . (?) 

We can readily observe that the following identity is true for alH G and k £ IC: 

dPjp) ^ dR^{pi,p.i) 

dpi{k) pi{k) 
or similarly, for any pj and pj G Vi and for fixed p_j. 



(8) 



i?i(pi, p_i) - Ri{pi, p_i) = P{pi,p-i) - P{pi, p^i). (9) 

We call the function P{p) the potential function associated with the game G. Due to the properties dSjl 
and (|9l), we call the game G a potential game. From lITTl . ll23l and ll24l . we have the following theorem. 

Theorem 1: A potential game, say Q = {$7, x, {C^jjtgn}. admits at least one pure-strategy NE. If the 
potential function P(.) associated with the potential game is concave, then a feasible strategy x* G x is 
a NE of the game if and only if it maximizes the potential function, i.e., x* G arg^g^ maxP(x). 

In light of the above theorem, we immediately have the following Corollary. 
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Corollary 1: p* is a NE of the game G if and only if 

p* Gargmax-i J] |^log l^{k) + Y^%{k)\''pi{k)^ -\o^{n{k))^ . (10) 

B. NE as Optimum Input Strategies that Maximize the Network Sum Rate 

It turns out that the potential function ([7]l has a nice physical interpretation. We show in this subsection 
that it can be related to the maximum sum rate achievable for the considered network. In order to make 
the above statement precise, we digress a little to consider the following sum capacity maximization 
problem of the vector MAC system. 

Consider a vector MAC communication system |fT6l with users and a single AP. The users and 
the AP are both equipped with K antennas. Assume the available bandwidth is -j^. Let be user /c's 
transmitting vector signal; let Hfc he & K K matrix that represents the communication channel between 
user k and the AP; let y be the aggregated received signal at the AP; let z be the additive Gaussian noise 
with covariance matrix . Then y can be expressed as follows Q: 

y = J]H,Xi+^. (11) 

Let I]j = £^[xfcx^] denote user i's transmission/input covariance. The users are constrained in their 
individual power output, i.e., the input covariance of the users should satisfy tr{Y,i) < pi, V i G M. The 
capacity-achieving input distribution is known to be a complex Gaussian distribution, and the optimum 
set of input covariances {Yl*}i^_^f that maximize the capacity of this system can be found by solving the 
following problem |[T6l . lIlTI : 



max C(S) = ^\ \og 
K \ 



log|S,| (12) 



s.t. trCSi) <pi V i G TV 
^ y ieAf 

where X) = {Sj}jg^ indicates the joint transmission covariance matrix. Notice, that the objective function 
C(5]) is a concave function ||2TI . hence, it admits a unique maximum value in the feasible region. 
However, this function is generally not strictly convex, and there are a (connected) set of optimum points 
that achieve such optimum value. 

'clearly the received signal y has a similar form with that of the received signal in considered single AP network (cf. (|2j), 
consequently, the results derived in this vector MAC system are instrumental in analyzing our single AP network. 
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We have the following proposition regarding the input covariance matrices Xl* that maximizes the sum 
rate of the system. 

Proposition 1: If Hj is diagonal for each i, and if Sl^ is diagonal, then there must exist a set of 
diagonal input covariance matrices {S*}jg_^ that is the optimum solution of the problem (I12I ). 
Proof: We prove this proposition by contradiction. Consider the following convex problem: 

1 



max C(S) 



K 



log 



log|S, 



(13) 



s.t. tr(5]i) < Pi, M i£N 
^ 0, y ieAf 
I]j is diaganol, V i G M. 

Suppose the set of diagonal matrices {S*}jg7\^ is an optimum solution of the problem ([T3] ). but it is not 
an optimum solution of the problem ([T2l ). For each user i G M, let Sj be the (unique) solution of the 
following optimization problem: 



max — I log 



log|S, 



(14) 



s.t. tr{Tii) < Pi 
y 0. 

Then from Theorem 1 of llT6l . and the assumption that is not optimum solution to the problem 

(fT2l) . there must be at least one user i that, by changing its input covariance matrices from to Hi, it 
can strictly improve the objective function, i.e., 3 i G M, such that C(Sj,Slj) > C(5]*). 

Let us now find the optimum solution X)j of the problem ([141 ). Let N = HjS*Hj + S^, then N 
is diagonal because Xl^, {S*}j^j and {Hj}j^j are all diagonal. Clearly it is also semi-definite. Define a 
matrix Q with its elements satisfying: 

1 if {N]i,i ^ 



[Qki 



otherwise 



Then solving problem ([141 ) is equivalent to maximizing the function log QHjSjHjQ + I 
inequality ||2T1 and the fact that QHj is diagonal, we have that: 



(15) 



. From Hadamard's 



log 



QH,S,HJQ + l\<Yl log (l + mlkMlki^ikk 

keK. 
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where the equaUty is achieved if and only if Sj is diagonal, and satisfying < J2k=ii'^i]k,k < Pi- 
Consequently, we conclude that the optimum covariance matrix Sj of the problem (fT4l l is a diagonal 
matrix. However, this contradicts the assumption that the set of covariance matrices is an 

optimum solution of the problem (fT3] ). because the new set of covariance matrices {Slj, Xlj} is a feasible 



The proof of Proposition [T] points out that when {Hj}jg^ and "E^ are diagonal, any solution to the 
optimization problem ([T3] ) must be an optimal solution to the original problem (IT2] |. In this case, the 
users only need to select the amount of power on each antenna (power loading transmission scheme) to 
achieve the maximum sum rate, and the objective function C(5]) of ([T3] ) can be reduced to: 



Clearly, when the channel matrices and the noise matrix are all diagonal, and the users choose the diagonal 
transmission strategy, the A^-user A'-antenna vector MAC channel introduced above is equivalent to the our 
previously considered A^-CU i^T-channel single AP network (as can be seen from the equivalence of equa- 
tion ^ and (fTTI)). with the following correspondence of parameters: 5]j = diag{[pi{l), • • • ,pi{K)]) V i G 
M, Hi = diag{[hi{l),--- ,hi{K)]) V « G and = diag{[n{l), ■ ■ ■ ,n{K)]). 

The vector MAC capacity maximization problems ([T2l ) or ([T3T l can be readily seen as equivalent to the 
potential function maximization problem ([TOl l. and the maximum value of the potential function P{p), 
say P*, corresponds to the maximum sum rate achievable for the considered N-CU, K-channel, single 
AP network. Corollary [T] implies that any NE of the game G maximizes the potential function P{p) 
among the feasible solutions. Consequently it is also an optimal solution of (fT3] ). thus can be viewed as a 
set of optimal input covariance matrices (or optimum power loading scheme because of the diagnonality) 
that maximizes the achievable sum rate of the system. 

However, this result does not imply that the sum of the CUs' rate at a NE achieves the optimal system 
sum rate. We should point out here that in general, one needs to have both optimal transmission strategy 
and optimal receiving strategy to be able to achieve the MAC capacity. In our context, this is to say 
that in general, at a NE of the game G, although the CUs' transmission strategy is optimal, the sum of 
individual CUs' rate should be less than the maximum achievable sum rate of the system (notice, that in 
our considered network, the assumption is that only single-user receiver is implemented at the AP, which 
is obviously not an optimal receiving strategy). 

However, we observe that if a NE of the game G represents a FDMA strategy, then the maximum 
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system sum rate is achieved using only single-user receiver at the AP. More specifically, if a NE p* 
represents the FDMA strategy, then there is at most a single user i G A/" transmitting on each channel 
k (i.e., with p*(A;) > 0). Define the index set I{k) = {i : p*{k) > 0}, we have that under a FDMA 
transmission strategy \T{k)\ < 1, V A: G /C. Clearly in this case, the sum rate of the users (when the AP 
uses single-user receiver) achieves the maximum sum rate of the network: 

V-pr* *\ V*iik)\hi{k)\^ \ 



i&N i&N k=l \ 

k=li£X{k) ^ 

(b) 



n{k) + Z,^^Pj{k)\h,ik)\ 



2 



n{k) 



J] ^log {n{k) + J] \h^{k)\^pl{k)^ - \og{n{k))^ = P{p*) (16) 

where both (a) and {b) are from the FDMA property of the NE p*. 

The question remains as in what situation does the NE of the game G represents the FDMA transmission 
strategy. In the next subsection, we provide an answer to this question by looking at the situation when 
the available spectrum is arbirarily finely divided, i.e., K ^ N. 

C. The Asymptotic Optimality of the NE 

In this subsection, we consider the asymptotic situation in which the available bandwidth is arbitrarily 
finely divided. In this case, the transmission rate for each CU i can be expressed as ll2ll : 

Jo \ n{w) + }2j^iPjH\hj{w)\^ J 

where the channel gain \hi{w)\'^ can be viewed as the channel transfer function for CU i to the AP; n{w) 

becomes the spectral density of the Gaussian noise experienced at the AP on channel k; pi denotes the 

transmit power spectral density of CU i, i.e., Pi{w) indicates the amount of power CU i transmits on 

frequency w. In this case, the sum power constraint of each CU should be expressed as: pi{w)dw < pi, 

and the feasible region Vi becomes: Vi = {pi : Jq pi{w)dw < pi,pi{w) > V w}. 

As before, a selfish CU i is interested in solving the following optimization problem: 

f\ I, , pdw)\hi{w)\'' \ ^ 

max / log IH ^- , ,., , ..^ dw. (18) 



From the definition of the NE and the solution to the CU's utility maximization problem (1181 ). individual 
equilibrium transmit spectral p* must satisfy: 



p*{w) 



V w. (19) 
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We have the following theorem characterizing the system equilibrium transmit spectral {p*}jg^. 

Theorem 2: When the available spectrum is arbitrarily finely divided, and the channel gains are 
generated according to some continuous distribution, then any NE of the game G represents a FDMA 
transmission strategy ( with probability 1 ). Moreover, any such NE is efficient, in the sense that the sum 
of individual users ' rates achieves the maximum system sum rate. 

Proof: We first show, by contradiction, that any NE represents the FDMA strategy. 

Suppose for some channel realization {\hi{w)\'^} i^j>j- , in the NE of the game a set of CUs M (1 M 
are using the frequency w. In another words, we assume the following: 



Pm{w) = 0-m . > 0, V m G 7W. 



(20) 



\hm{w)\'^ 

Then the following is true for all m ^ M: 

n{w) + p*{w)\hj{w)\'^ = am\hmiw)\'^ ■ (21) 
jeM 

Thus, for an arbitrary pair of CUs mi, m2 G M: (Tmi |/imi ('U^)P = <7m2 l^ma However, this 
equality is satisfied with probability zero (see the proof of Theorem 1 of 13]), because of the fact that 
and are constants, and that the channel coefficients are random variables drawn from continuous 
distributions (Rayleigh distribution or Rician distribution in fading channels). In summary, we claim that 
the equilibrium transmit power spectral {p*} follows a FDMA scheme with probability 1. 
Due to the above FDMA frequency allocation scheme, when p* (w) > 0, it must be true that: 



p*{w) 



n(vj) 



\K(w)\^ 

Consequently, we can have, for j / i (thus Pj{w) = 0): 

n{w) + \hi{w)\'^p*{w) 



n{w) 



(22) 



Pj [w) 



\h,{wW 
n{w) + \hi{w)\'^p*{w) 



< 



(23) 



where (a) is because of 



From Theorem 2 of ||25l . we know that the A^-user FDMA scheme 



maximizes the system sum-rate for a Gaussian multiple access channel if the following is true: 

-1 

Pi{w)dw = pi, V i G 



Pi{w) 



n(w) 
\hi{w)\^ 



0, 



if b,\hi{w)\^ > bj\hj{w)\^y j 
otherwise. 



(24) 
(25) 
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Comparing (|221)-(|23]| with (|24]|-(|25T|. we can readily identify that ai = hi, and we conclude that 
{Pi }jeA^ achieves the maximum system sum rate. ■ 

We see from the above derivation that the optimum channel assignment should take into consideration 
the following three factors ll26l : 1) users' channel quality; 2) users' power budget; 3) the noise power. 
The results derived in Theorem |2] is desirable because in practical multi-carrier systems (e.g. OFDM 
system), the number of channels is indeed very large compared with the number of users in the system. 
Consequently, the NE of the spectrum sharing game G represents a desirable outcome in which the CUs 
in the network share the spectrum efficiently. Interestingly, the authors of ||3l has shown that the NE for a 
uplink power control game represents a time-sharing strategy (which can be viewed as dual to our FDMA 
strategy), but in a 2-user fading channel system which is very different from the system we consider. 

Now the question becomes how such equilibrium point(s) can be reached by individual CU in a 
distributed fashion. In the next section, we provide three algorithms for such purpose. 

IV. The Proposed Algorithms and Convergence 

From the argument in the previous section, we see that finding the NE of the game G is equivalent 
to finding p* G argmaxpgp maxP(p). This is a convex problem and can be solved in a centralized 
way if all the parameters of the system (e.g., ||/ij(A;)p}, {pi]) are known. However, in a distributed 
environment, where the CUs are selfish, uncoordinated and not well informed of other CUs' channel 
coefficients and power budgets, it is not immediately clear how to find such NE point in a distributed 
fashion. 

A. Inapplicability of Conventional IWF Algorithm 

We first notice that our model of the network is a special case of a more general network with Gaussian 
interference channel that has been extensively studied recently, for example, in |[T8l . |[T9l . ||27l . ||28l . In 
those works, the CUs are transmitter-receiver pairs, and they are interested in allocating their limited 
transmission power on the set of channels /C to maximize their individual transmission rate. We refer 
to this network as a Peer-to-Peer (PP) network, while referring to our network as a Access Point (AP) 
network. In the PP network, we use \Hij{k)\'^ to denote the channel gain from CU i's transmitter to CU 
j's receiver on the k^^ channel; we use ni{k) to denote the environmental noise power at the receiver of 
CU i on channel k. An individual CU i, by transmiting Pi{k) on the k^^ channel, contributes to every 
other CU j / i in the network the amount of \Hij{k)\'^pi{k) interference at their respective receivers. 
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Now consider the scenario where all the CUs' receivers are co-located. In this case, for a particular 
CU i, the set of channel coefficients {\Hi,j{k)\'^}j=ii become equal to the value of \Hi^i{k)\'^; the set of 
environment noises {ni{k)}i(zj\f can be considered equal because the receivers are located at the same 
place. Consequently, the PP network is equivalent to the AP network. 

At this point, we might come to the conclusion that the distributed algorithms developed for the PP 
network automatically works in the AP network, after all, the PP case is more general than the AP case. 
However, we show in the following that this is not true. As a matter of fact, the sufficient conditions for 
the convergence of most algorithms proposed for the PP network are not satisfied in the AP network. As 
an example, we consider the sufficient condition for the simultaneous IWF algorithm proposed in ||27| . 
Define K nonnegative matrices H(A;) G M^^^ with their elements defined as follows: 



[ilkr{k) = { l^'-^'^)!^ (26) 
otherwise. 



Define another nonnegative matrix H™'^'^ G M^^^ as follows: 



[H]-xA^ --'^{feiF} ^'^^^ (27) 
otherwise. 



From Theorem 1 in 11271 . we have that the simultaneous IWFA algorithm converges to the unique NE 
of the game if the following is true: p(H™^^) < 1. In the following, we prove that in AP scenario, this 
condition can not be satisfied. 

From the Perron-Frobenius Theorem ||29l . we have that there must exist a x 1 vector w > 0, such 
that ||H™'^^||^ = /3(H™^^), where ||A||^ is the maximum norm of a matrix A, and is defined as follows: 

Q 

||A||^ ^max — VrAl^^-u;^, A G R^''^. (28) 

We next show that in the AP case, there could be no positive vector w satisfying ||H™^^||^ < 1. Note 
that we have for all A; G /C, < H(/c) < H"'^^ componentwise, which implies ||H(A;)||^ < HH"^^^!]^ 
( ll29l . Chapter 2, Proposition 6.2). Consequently, it is sufficient to prove that there exists k € JC, such 
that for all w > 0, we must have ||H(/c)||^ > 1. 

Choose any /c G /C. Suppose there exists w > such that ||H(A;)||^ < 1. This implies that: 
max, ^ fewF^^ < 1- Then it must be true that for every j G M, ^ few^^ < ^' 
which is equivalent to say that the following N inequalities are true simultaneously: 

I Hi J (k) \^Wi<\ H,j {k) \''wj,y j^M. (29) 
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Recall that when reduced to AP configuration, we have that for all j / i, \Hij[k)\'^ = \Hi^i{k)\'^. 
Using this equality and adding up N inequalities in (l29l ). we must have: 

{N-1)Y, \H^,^{k)\^W^ < J] \Hj,jik)\^Wj. (30) 

Because all the channel coefficients are greater than 0, the above inequality can not be satisfied for any 
w > 0. Consequently, we prove that there does not exist any w > such that ||H(A;)||^ < 1. Thus, we 
must have that ||H(/c)||^ > 1, which in turn says that for all > 0, we must have ||H™'^^||^ > 1, 
and this implies p(H™^^) > 1. We note further that since p(HJ^^^) < ||H™'^^|| for any norm (Prop.A.20 
in |[29l ). we must have that HH™*^^!! > 1 for arbitrary norm. Moreover, we can show similarly that for 
arbitrary norm, ||H(A:)|| > 1, V A:, and thus p{il{k)) > 1 V A;. 

In order to further explain the reason why, in general, algorithms for the PP configuration fail to 
work in our AP configuration, we observe that almost all the algorithms designed for PP configuration 
rely on some restrictive conditions of the channel gains to ensure the uniqueness of the equilibrium. 
For example, in lITSl . the condition p{'H.{k)) < 1, V A; G /C ensures the NE of the power allocation 
game is unique. However as we see in our previous argument, in the AP configuration such condition 
is not true for any realization of the channel gains. As a matter of fact, a straightforward consequence 
of Corollary [T]is that in general the AP configuration admits a (connected) set of equilibrium solutions, 
as the objective function of the optimization problem ([TOl i is concave, but not strictly concave. A simple 
example illustrates this point. 

Example 1: Consider the network with N = 2 CUs, K = 2 channels. Let = |/i2(l)p = 1, 

|/ii(2)p = |/i2(2)p = 2, n{l) = n(2) = 1, and let pi = p2 = 1- We can show that both the following 
two system power profiles p and p are the NE for the game related to this network: 

pi{l) = ^, pi(2) = ^; p2(l) = 0, pi(2) = 1; (31) 

and 

pi{l) = 0, pi{2) = 1; p2(l) = ^, pi{2) = ^. (32) 

Clearly, from the concavity of the potential function, all the convex combinations of the solutions p and 
p also maximize the potential function, hence they are also NEs of the game G. 

We conclude the above argument by saying that although the AP network indeed is a special case of 
the more general PP network, for which distributed algorithms have been developed to reach the NE, 
these algorithms may not be directly applicable to the AP scenario. Indeed, we will see later in the 
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simulation section, that by applying the simultaneous IWF algorithm directly to the AP network results 
in divergence. 



B. Proposed Algorithm based on IWF: Weak Convergence 

We now proceed to develop algorithms so that the CUs in the single AP network can distributedly 
reach the NE. In the following we propose two such algorithms. 

Algorithm 1: Averaged Iterative- Water Filling Algorithm (A-IWF): 

In each iteration t, the CUs do the following. 
1) Calculate the best reply power allocation: 

1 n,{k)+Y.^^^h,{k)\mk) ' 



where cji ensures Y^kaK^ ^'liv^^d = Pi' and let *i(p*„j) = [^>,Hp*„j), ■ 
2) Adjust their power profiles simultaneously according to: 



Pi ' ^ = (1 - "t)Pi + "t*i(p-i) 

where the sequence {a^}^^ satisfy at G (0, 1) and : 



lim > at = oo, lim > 

T-i^oo ^-^ 



T-s-oo 



a^ < oo. 



i=l t=i 
Algorithm 2: Sequential Iterative-Water Filling Algorithm (S-IWF): 

In each iteration t, the CUs adjust their power profiles sequentiallyn according to: 



(33) 



(34) 



(35) 



(36) 



The convergence properties of the above two algorithms are stated in the following two propositions. 

Proposition 2: If all the CUs in the network employ A-IWF algorithm, then their individual power 
profiles converge to the set of NE of game G. 



Proof: Define $(p) = [*i(p- 



: ^Af(P-Af)]^- Define s(p) = $(p) — p. Then from the system 



point of view the A-IWF algorithm can be written concisely as: 

p*+i = (1 - at)p* + at*(p*) = p* + ats(p*). 



(37) 



^By "sequential" we mean that the CUs in the set M take turns in changing their power allocation, and only a single CU 
gets to act at time t. All other CUs j ^ i, j & M keep their power allocation as in time t — 1. 
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We first introduce two lemmas. Tlie proof of Lemma H] can be found in Appendix lAl and we omit the 
proof of Lemma |2] for brevity. 

Lemma 1: There must exist a constant M, with < M < oo, such that s(p)''" VpP(p) > M||s(p)|p. 

Lemma 2: For two arbitrary vectors p £V and p GV, and for arbitrary norm \\.\\, there must exist 
two constants < D < oo, and < K < oo such that 

||s(p)-s(p)|| <D\\p-p\\ 
||vP(p)- VP(p)|| <K||p-p||. (38) 

In order to conform to the convention in convex optimization, we define the function F{p) = —P{p), 
and we see that F{p) is convex. 

Then from the well known Descent Lemma (Lemma 2. 1 in |[29l ). and Lemma |2] we have that: 



<F(p*)-a,M||s(p,)|P + -«?||s(p,)|P 



K 



= F{p')-at\Hpt)\\'{M -at-). (40) 

Because at goes to 0, then when t large enough, M — at^ > 0, and -F(p*) is monotonically decreasing. 
Combined with the fact that -F(p*) is lower bounded, then {P(p*)}^i is a convergent sequence. 
From (|39] l. we have that 

F{p^^') < F(p°) + x:«ts(p*)^vF(p*) + y: f «?iis(pt)ip. 

It is clear that I |s(pt)|p is upper bounded, and we have limy-xx. o^t < 00, so limr^oo Z^^o T"^* I l^(P*)l 1^ ^ 
00. Because limr_j.oo F(p^+^) converges, we must have 

T 

lim y^ats(p*)'''vF(p*) > -00 

t=0 



T 

T-)-oo ' 



lim J]ats(p*)TvP(p*) < 00. (41) 
From Lemma [T] we have 

T T 
M lim Vat||s(p*)||^ < lim V Qts(p*)T vP(p*) < cx). 

t=0 t=0 

Consequently it is clear that we must have liminff^oo ||s(p*)|| = 0. We show in the following that in 
fact we have a stronger result that lim^^oo l|s(p*)|| = 0. Suppose not, then limsup^^^o^ ||s(p*)|| > 0. In 
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this case there must exist a e > such that the subsequences {T(n) : | |s(p'^("))| | < e, | |s(p'^(")"'"^)| | > e} 
and {u(n) : e < ||s(p*)|| < 2e,V t G (T(n),u(n) - 1), ||s(p"("))|| > 2e} are both infinite. 
For a specific n, the following is true: 

||s(pr(n)+l)|| _ ||s(p^W)|| < L)||p^(")+1 - p^(")|| 

I |s(p-W)| I . (42) 

Thus, there exists a iV* such that for all n > N*, we must have ||s(p'^("))|| > |. 
We also have the following: 

e < ||s(p"("))|| - ||s(p^("))|| < Z)||p"(") -p^(")|| 

t=u{n)-~l t=n(n) — 1 

<D oct\\s{p')\\<D "t2e (43) 

t=T{n) t=T{n) 



which implies 



^ t=u{n) — l 

^< E (44) 



2D 

t=T(n) 



From our previous derivation, we also have limr^oo X]^o '^ill^(P*)ll^ ^ Then for any 5 > there 
must exists a N{6) such that for all n > N{5): YllZrP)"^ «tl|s(P*)lP < ^■ 
Take 5 = and take n > max | A^*, then we have 

2 t='u{n) — 1 t=tt(n) — 1 2 

7 E E »tMp')\\'<^ (45) 

t=T(n) t=T{n) 

which implies X]t=T(n)~^ — IS" "^^^^ ^ contradiction to (l44l ). Thus, we conclude that lim supj^^^^^ I |s(p* 
0, and consequently limf_j.oo ||s(p*)|| = 0. 

From limf_j.oo ||s(p*)|| = we see that the limit point pj^ of any converging subsequence of {p*}, 
say {p*"*}J^=i, must satisfy $(p^) = p^, which is sufficient condition to ensure that pj^ is a NE of 
the game G. Consequently pj^ must maximize the function P{p) (from Corollary [T), and this implies 
that the entire sequence {P(p*)}^]^ converges to the value P* = maxpgp ^'(p)- It also implies that the 
sequence {p*}^^ converge the set of NE of the game G, or in other words, every limit point of {p*}^]^ 
is a NE of the game G. ■ 

Proposition 3: If all the CUs in the network employ S-IWF algorithm, then their individual power 
profiles converge to the set of NE of game G. Moreover, the potential function (p*)}j^i is non- 
decreasing with respect to iteration step t, i.e., P{p^~^^) > -P(p*). 
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Proof: It is easy to see that this algorithm corresponds to the nonUnear Gauss-Seidel algorithm in 
solving constrained optimization problem ll29l . thus the general theory for the convergence of algorithm 
can be applied (e.g., |[29l Proposition 3.9). ■ 
The S-IWF algorithm is actually a simplification of the algorithm proposed in |[T6l . We introduce this 
algorithm here and briefly discuss its convergence analysis because it will be useful in our analysis in 
the second part of this paper. We need to point out here that the convergence behaviors characterized 
for A-IWF and S-IWF are set convergence, i.e., the distance between the sequence {p*}^i and the set 
of NE decreases to zero. Theoretically, it is possible that multiple limit points exist for such sequence, 
hence this convergence behavior is weaker than the "strong convergence", in which the sequence {p*}^]^ 
admits a single limit point in the set of NE. In practice though, convergence of the sequence {p*}^i is 
always observed However, for the sake of rigor, in the next subsection we propose a third algorithm 
which converges strongly to the set of NE. 

C. Proposed Algorithm based on Gradient Descent: Strong Convergence 

Algorithm 3: Projected Gradient Descent Algorithm: 

In each iteration t, the CUs do the following. 

1) Calculate the gradient of the potential function: 



Vp.P(p*) 



Ml)? \hi{K)\ 



2 



2) Adjust their power profiles simultaneously according to: 



(46) 



p*+i = [p* + atVp.P(p*)]^^ ^ *i(p*) (47) 

where the sequence {at}'^i satisfy at G (0, 1) and ( [35l ): the operator represents the projection on 
to the space Vi. 

Clearly, this algorithm is based on the classical projected gradient descent algorithm for solving 
nonlinear optimization problem, but with diminishing stepsize at- In order to prove the convergence 
of this algorithm, we first introduce the notion of Quasi-Fejer convergence |[30l . ||3TI . |[32l . 

Definition 1: A sequence {y*} is Quasi-Fejer convergent to a set U C if for every u £ U there is 
a sequence {e^} such that et > 0, Yl'S^o^t ^ °° ^"'^ lly*"*"^ ~ ''^IP ^ l|y* ~ ""IP + ^t, V t. 

The Quasi-Fejer sequence has the following property IIBTI . |[32l . 

'The S-IWF algorithm proposed in II16I for vector MAC channel also converges to the set of optimum points similarly as 
ours, and in practice it has been observed that such algorithm always converges to a single point. 
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Theorem 3: If {y*} is Quasi-Fejer convergent to a nonempty set U C R", then {y*} is bounded. If 
furthermore a limit point y* of {y*} belongs to U, then limfc_>.oo y'' = y*, i.e., the sequence converges 
to a single point in U. 

Using the notion of Quasi-Fejer convergence, we have the following strong convergence result for 
Algorithm 3. Please see Appendix IB] for proof. 

Proposition 4: The projected gradient descent algorithm is Quasi-Fejer convergent to the set of NE of 
game G, with error term et — 2at ('^'(p*) — p*)^ VpP(p*). Moreover, the sequence {p*}^]^ generated 
by this algorithm converges to a point in the set of NE. 

D. Discussion 

We first note that all the three algorithms proposed in the previous subsections can be carried out in a 
distributed fashion. That is, in order to carry out the computations in each iteration (mainly to compute 
$j(.) or of the algorithms, the CUs do not need to know the behavior of other CUs in the network. 

Instead, an individual CUs i only needs to know the aggregated interference plus noise (IPN) contributed 
by all other CUs on each channel: IPNi{k) = n{k) + \ ^ji^)\'^Pj(.^)i V A; G /C. As suggested by 
im, this information can be fed back to the CUs by the AP. In fact, the AP only needs to broadcast the 
quantity {n(A;) + X]«eA^ \^ji^)\'^Pji^)} keK CUs, and individual CU i can subtract its contribution 

and calculate {IPNi{k)}k^ic. We can also show that, similarly as in the previous two subsections, that 
a more general case of the algorithm where each CU i adopts different sequences of update coefficients 
(say {al}^i) also converges, as long as each sequence {aj}^^ satisfies the conditions in (1351 ). 

As stated previously, the theoretical categorization of the algorithms by their convergence behaviors 
is necessary, because it is generally not possible for the game G to have a single equilibrium point. 
Although for the algorithms in both categories, the potential function (or equivalently the sum capacity) 
converges to the single optimum point, the convergence behavior of the underlying CUs' strategies are 
more involved. Simply claiming the algorithm to be "convergent" might be too ambiguous and sometimes 
misleading 0. We observe that many iterative water-filling based algorithms for calculation of the capacity 
for vector MAC and broadcast channels, for example the algorithms in |[T3l . |[T6l and ll33l . can only be 
theoretically proven to be weakly convergent (in which the optimum capacity is attained in the limit, but 
the underlying sequence converges to the optimum set), although in practice they generally converges to 
a single optimum point. 

''indeed, in many situations convergence to a set leads to oscillation of the sequence. For example, the sequence {j + 
sin(0.57rf)}^i converges to the set {-1, 0, 1}. 
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For the descent algorithm, note that if the update step size is a constant, then the algorithm is also 
weakly convergent (see Prop. 3.4 of ll29l fl The descent algorithm with diminishing step size is also 
used in |[34ll for network utility maximization with feedback uncertainty, and the problem considered is 
very different from ours. We remark that, strong convergence does not imply fast convergence. Indeed, 
although we are able to show that the projected gradient decent algorithm converges strongly (which is 
theoretically appealing), in practice it tends to converge much slower than A-IWF and S-IWF. As such, 
in the second part of this paper, we will only choose A-IWF and S-IWF as building blocks for the joint 
AP selection and power allocation algorithm. 

V. Simulation Results 

In this section, we demonstrate the performance of the proposed algorithm. We have the following 
general settings for the simulation. We place multiple CUs and the AP randomly in a 10m x 10m area; 
we let (ij^ denote the distance between CU i and AP w, then the channel gains between CU i and AP 
w. Unless otherwise noted, {|/ij,ji)(^)P}fc6A:„ are independently drawn from an exponential distribution 
with mean -^j^ (i.e., |/ij,u,(A;)| is assumed to have Rayleigh distiibution). 

Fig. [T] shows a typical realization of the three algorithms analyzed in this paper, in a network with 10 
CUs and 32 channels. It is seen that the values of the potential function generated by these algorithms 
converge to the maximum system capacity quickly, but the sum rate of the CUs (hence individual power 
profiles) converges slowly for the projected gradient descent algorithm. 

Fig. |2] partly quantifies the convergence speed of different algorithms. In this figure, we compare 
the absolute difference between the maximum system sum rate and the values of the potential function 
generated by different algorithms (i.e., {|P(p*) — P*|}), in a network with 20 user and 64 channels. 
We observe that both the A-IWF and S-IWF algorithms converge relatively fast while the projected 
gradient descent algorithm, as seen in Fig|T] converges slowly. We have also studied the performance of 
simultaneous IWF algorithm ll27l . which clearly diverges in our single AP network. Such phenomenon 
has been partially explained in Section IIV-AI In Fig. [3j we characterize the convergence behavior of the 
sum of the CUs' rate R{t) = X^jgA^ ^«(P*)' ^'y plotting the relative difference between R(t) and R{WO): 
^^^^R{ioo) ^^^^ ■ Such metric can be viewed as related to the convergence speed of the algorithm. We see 
that for network with 128 channels and with increasing number of CUs, S-IWF converges increasingly 

^Consequently, most algorithms proposed for potential games based on projected gradient methods (e.g., those in II24I ') can 
also be categorized as weak convergence when the potential function is concave but not strictly concave. 
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Fig. 1. A particular realization of the algorithms with K=32 and N=10. 



slowly. Such behavior of the S-IWF is intuitively considering the sequential nature of the algorithm. We 
note that each point in both of these two figures is an average of 100 independent runs of the respective 
algorithms. 




Fig. 2. Averaged absolute difference between the maximum Fig. 3. Convergence behavior of the sum rate of CUs generated 
system capacity and the value of potential function generated by by A-IWF and S-IWF with K=128. 
different algorithms. K=32, N=10. 

In Section |III-C[ we have predicted that for a fixed number of CUs, when the number of channels 
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becomes large, the CUs tend to share the spectrum in a FDMA fashion, and the sum rate of the users 
approaches the maximum achievable system sum rate. Fig. 5] and Fig. |5] justify these claims. We say that 
a channel is collided if more than one CUs are using this channel. We say that a (event of) collision 
occurs if two CUs are using the same channel^ In Fig. |4l we plot the relationship between the number 
of channels in the system and the number of collided channels as well as the total number of collisions. 
Clearly, as the number of channels becomes large, both of the above quantities decreases. We also observe 
that when the number of channels becomes large, the number of collided channels tends to be the same 
as the total number of collisions, a phenomenon which implies that there tend to be no more than two 
CUs using a collided channel. In Fig. [51 we show the relative difference between the sum rate of the 
CUs after 200 iteration of the A-IWF algorithm and the maximum sum rate (i.e., 1 ^^ when 

the number of channels becomes large. The decreasing of such relative difference is an indication of 
increased efficiency of the spectrum sharing among the CUs. We note that each point in both of these 
two figures is again an average of 100 independent runs of the respective algorithms. 




Number of Channels Number ol Channels 



Fig. 4. Averaged number of collisions and averaged number of Fig. 5. Averaged relative difference between the sum rate of 
collided channels. the CUs and the maximum sum rate of the system. 

To quantify the overall efficiency of the spectrum sharing scheme, we plot the normalized system sum 
rate ^p^'^-' in Fig. |6] for the network with different number of CUs and different number of channels. 
Clearly the sharing scheme becomes more efficient when the number of channels becomes large. Notice, 
that in all the previous simulation experiments, we assume that the channel coefficients {hi{k)}k^ic 
of a particular CU i to be independent. This is true when the width of each channel is comparable 

*If n > 2 CUs are using the same channel, then there are a total number of ("~^)(") collisions occurred. 
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TABLE I 

Averaged Number of Collisions for Different Channel Coherent Bandwidth with A'' = 20. 





Independent 


Be = 0.1 


Be = 0.2 


Be = 0.5 


Be = 1 


Total Collision, K=300 


6.40 


7.91 


11.12 


11.60 


14.69 


Total Collided Channels, K=300 


6.13 


7.70 


10.06 


10.63 


13.90 


Total Collision, K=600 


4.31 


5.21 


6.00 


7.65 


12.67 


Total Collided Channels, K=600 


4.30 


5.07 


5.81 


7.61 


12.19 



to the coherent bandwidth, denoted as B^. f35). However, when we divide a fixed spectrum band with 
arbitrarily large number of channels, the coherent bandwidth eventually becomes larger than the channel 
width. Indeed, as mentioned in |[T3l . in practice the parallel frequency selective channels are usually 
correlated. As a result, in Fig JT] we study the spectrum sharing efficiency for a network with = 20 
CUs and with networks of different channel coherent bandwidth Be = {1, 0.5, 0.2, 0.1} (recall that 
our total available bandwidth is normalized to 1). For reference we also plot the case where the channels 
are assumed to be independent. We observe that large coherent bandwidth reduces the sharing efficiency. 
This phenomenon can be explained by noticing that when the channel becomes correlated, the event of 
collision is more likely to happen, as shown in Table H We again note that each point in both of these two 
figures and each entry in the table is an average of 100 independent runs of the respective algorithms. 
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Fig. 6. Comparison of the averaged spectrum sharing efficiency Fig. 7. Comparison of the averaged spectram sharing efficien- 
to the number of channels, N=[15,20,25,30]. cies when the coherent bandwidth is different. N=20. 





^N=20 


ndependenl 


-a-N=20, 


B^=0.1 


-^N=20 


B^=0.2 


-A-N=20 


B^=0.5 


-e^N=20 


8^=1 
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VI. Conclusion 

In this first part of the paper, we formulate the upUnk spectrum sharing problem in a single AP CRN 
into a non-cooperative game framework. We identify that this game belongs to the family of games 
called the "potential games", and we characterize the properties of the proposed game. We then propose 
three algorithms with different convergence properties that allows the CUs in the network to access the 
spectrum in a distributed fashion. From simulation we see that the proposed algorithms are able to reach 
the equilibria of the spectrum sharing game, which represent a set of efficient spectrum sharing strategies. 

In the next part of the paper, we will study jointly the spectrum sharing and spectrum decision problem 
in a CRN with multiple APs. We will see how the algorithms developed in this part of the paper can be 
used for constructing efficient and distributed joint spectrum decision and spectrum sharing strategies. 



Appendix A 
Proof of Lemma [T] 
Proof: We first prove Lemma [T] We need to show that the following is true: 



TV K 



N K 



i=l k=l i=l k=l 

where (p) = <I>J^(p_j) — Pi{k). It is sufficient to show that for all i G Af, there must exist a constant 
< Mi < oo such that: J2k=i Vp.(fc)-P(p)Sj^(p) > Efcli (■sf (p))^ A^j. In the following, we will set out 
to prove that for all i G M, there must exist a Mi with < Mi < oo, such that: 

K 



fc=l 



We notice that 



'Piik) 



PiP) 



5.'(P) 



1 n{k)+j:^^^\h,{k)\^p,{k) 



Pi{k). 



(49) 

(50) 
(51) 



We also observe the following equality: 



K 



K 



K 



(52) 



E (p) = E *'(p-o - E^''(^) =p^~ = 0- 

k=l k=l k=l 

This can be readily concluded from our previous observation that from each user's point of view, it is 
beneficial to allocate all its power for communication. Using (l52l ). we see that in order to prove (|49] l. it 
is sufficient to prove that for all fci G /C and ^2 G /C such that 



s^(p) > 0, and s'^'{p) < 



(53) 
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there exists < M^'"''''"-' < oo such that: 

Vp,(,,)P(p) - (p) > Vp,(,,)P(p) - Mf I'^-^^s^^lp) (54) 

^Vp.(,,)P(p) - Vp.(,,)P(p) > Mf ^'"'^ (4^nP) - S^(P)) > 0. (55) 
If the above is true, we can take Mj = min^i^fc^ j^'^'^^^'^^ ^ thg^ for all k\, k2 that satisfies (l53l) . we have 

Vp,(fc,)P(p) - Vp.(fc,)P(p) > (sfHp) - s^(p)) 
^Vp,(,,)P(p) - M,s^(p) > Vp,(fe,)P(p) - M,s^(p). (56) 

Consequently, ( |49l ) can be established. 

Let us look at the term s^^{p) first. Let us simpUfy the notation by denoting s(''(p) = [A^^]^""'""''^''^^ - 
Pi{ki), where A^' = ^ - "^^''^^^|t'(fcOp'^''^'^^^'^ " Because (p) > 0, we must have that A'^' > 
Pi{ki) > 0, consequently, we have: 

< [4^]^-'=^'^^ < Af^ (57) 

We then look at the term s*'^(p). We can, similarly as above, also simplify it as sf^(p) = _ 
Pi{k2). Because s^^(p) < 0, we must have that A'I^ < Pi{k2) < Pmask{k2), consequently, we have: 

> K'lr"'"'"'' -P»(^2) > -Mk^). (58) 

As a result of (l57l ) and (|58] ). in order to prove (l54l . it is sufficient to prove that there exists < 

j^{kl,k2) ^ ^ ^^^^ 

Vp.(fe,)P(p)-A/fi^'^2'x(4^-p,(A:i)) 

> Vp.(,,)P(p) - M^"''"'^ X (A^ - Mk2)). (59) 
We see that ( [59l l is equivalent to 

Mki)\^ ^^iki,k2) ( 1 ^(fci) + Ej'Lil^j(fci)Ppj-(fci) ^ 



> |fe.(fc2)P ^,^(fcl,fe2) _ »(fc2)+E,'Ll|fe.(fc2)Pp,-(fc2) \ 

" n(fc2)+Ef=l|/l.(fc2)Pp,(fc2) * Mk2)? )' 

Define B^^ = ,, , „l^'^i^^'*J, — tt^, and similarly, we have that the above inequality can be 
simplified to: 



Qki _ j^jikl,k2) _L J_ ^ ^k2 _ ^^(fel,fe2) 



1 1 



^f^-^'^> (;^-;^) A^f'''''^- (61) 
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Now notice that: 

- (aI^ - p,{k,)) + - = and -(A^-p,(fc: 



11 



(62) 



and we have from (ISTT i and (ISST i that 



o>-s^(p)>-(A^-p,^). (63) 



We have that < and ' > i?f ' . Consequently, is equivalent to 



Mik^M) < ^ ^ ^fc2 X ^fei. (64) 



Now it is clear that we can always find such a < M- ' < oo, that satisfies the above inequality, 
because the fact that = ^L^'^i^^Lm, — ttt is always bounded above and strictly greater than 
(|/ii(A:)|2 > V i G A: G /C, n{k) >0y kelC). 

Now that we can always find < Mf'^'^'^^ < oo that satisfies (l54l ). from the argument above, we can 
see that < Mi = mink^^f^^^x: M^'^^'^'^^ < oo must satisfy ( |49l ). Thus, take M = minjg^Mj, we have 
that O is true. Thus, the proposition is proved. ■ 

Appendix B 
Proof of Proposition H] 

Proof: The projected gradient algorithm can be written as: p*"*"^ = [p* + afVpP(p*)]p = ^'(p*), 
where V = YiieAf'Pi- ^^^^ show that at least one limit point of the sequence p* is a NE of the game 
G. From the Projection Theorem ( ||29l Sec 3.3 Prop. 3.2) we have that: 

(*(p*) - p* - atVpP(p*))^ (p* - *(p*)) > 0. (65) 

Consequently, we have: 

< ||*(p*) - p*||2 < at (*(p*) - p*)' VpP(p*). (66) 
Similarly as in (|39] l. we invoke the descent lemma: 

F(*(p*)) < F{p*) + (*(p*) - p*)TVpF(p*) + y ||*(p*) - p*|p 

2 at 

< F(p*) + {-at - l)at||VpP(p*)||2 (67) 
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where (o) is from (l66l ): (6) is because of the non-expansiveness of the projection operator: 

II*(P*)-P*IP- ||[p* + «tVpP(p')]^- [p%||' 

<«?||VpP(p*)||2. (68) 

Thus there must exist a time T* such that V t > T*, F(*(p*)) < F(p*). From the fact that the function 
F(p) is lower bounded, we must have that the sequence -F(p*) converges. An immediate consequence 
of this result (cf. equation (1411 ) is that: 

oo 

^(*(p*)-p*)TVpP(p*) <cx). (69) 

t=i 

Let p* be a limit point of the sequence {p*}, then we must have that *(p*) = p*. This fact combined 
with the projection theorem implies that for any y G the following is true: 

> (y - *(p*))^ (p* + atVpP(p*) - *(p*)) 
= at(y-p*)'VpP(p*). (70) 

The last inequality shows that p* G argmaxpgp P(p), and consequently, p* is a NE of the game G. 

We then show that the sequence {p*} is Quasi-Fejer convergent to the set of NE. Using again the 
Projection Theorem, and (with a little abuse of notation) take p* to be any NE solution, we have: 

< (p* - ^{v')y (*(p*) - p' - a,VpF(p*)) 
= (p*-p*)^(*(p*)-p*-a*VpP(p')) 

+ (p* - *(p*))^ (*(p*) - p* - atVpP(p*)) . (71) 

This is equivalent to: 

(p*-p')^(*(p*)-p*) 

> at (p* - p*)' VpP(p') + ||*(p*) - p*||2 + at (p* - *(p'))' VpP(p') 

> ||*(p*)-p*||2+at(p*-*(p*))^VpP(p*) (72) 

where (a) is because of the fact that P(p) is concave: (p* - p*)^ VpP(p*) > P(p*) - -P(p*) > 0. The 
distance between p* and a arbitrary vector p*"*"^ can be expressed as follows: 

||p*-p*+i||2 

= Hp* - P*IP + Hp* - p*+'IP - 2 (p* - p*)' (p*"*"^ - p*) 

< Hp* - p*||2 - ||vl/(p') - p*||2 + 2at (*(p*) - p*)^ VpP(p') 

< ||p*-p*|p+2a,(*(p*)-p*)^VpP(p*) (73) 
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where (a) is from (l72]l and the definition of that *(p*) = p*+^. Now let us take = 2at (*(p*) - p*)^ VpP( 
Then we have:||p* — p*^^|p < ||p* — p*|p + et. From (l66l ) and (l69l ) we conclude {ej}^]^ is non-negative 
and summable sequence. Because p* is an arbitrary NE point, from Definition [T] the sequence {p*} is 
Quasi-Fejer convergent to the set of NE of game G. The first part of this proof show that a limit point 
of {p*} belongs to the set of NE, consequently, by applying Theorem |3j we see that {p*} converges to 
a point in the set of NE. ■ 
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